Dataset statistics
| Number of variables | 41 |
|---|---|
| Number of observations | 8522 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.7 MiB |
| Average record size in memory | 328.0 B |
Variable types
| Numeric | 18 |
|---|---|
| Categorical | 23 |
fr_Al_COO is highly correlated with fr_COO and 1 other fields | High correlation |
fr_Al_OH is highly correlated with fr_Al_OH_noTert | High correlation |
fr_Al_OH_noTert is highly correlated with fr_Al_OH | High correlation |
fr_COO is highly correlated with fr_Al_COO and 2 other fields | High correlation |
fr_COO2 is highly correlated with fr_Al_COO and 2 other fields | High correlation |
fr_C_O is highly correlated with fr_C_O_noCOO and 1 other fields | High correlation |
fr_C_O_noCOO is highly correlated with fr_C_O and 2 other fields | High correlation |
fr_amide is highly correlated with fr_C_O and 2 other fields | High correlation |
fr_ArN is highly correlated with fr_NH2 | High correlation |
fr_Ar_NH is highly correlated with fr_NH1 and 1 other fields | High correlation |
fr_NH2 is highly correlated with fr_ArN | High correlation |
fr_Nhpyrrole is highly correlated with fr_Ar_NH and 1 other fields | High correlation |
fr_Ar_COO is highly correlated with fr_COO and 1 other fields | High correlation |
fr_NH1 is highly correlated with fr_Ar_NH and 1 other fields | High correlation |
fr_Ndealkylation1 is highly correlated with fr_C_O_noCOO and 1 other fields | High correlation |
fr_N_O is highly skewed (γ1 = 23.02680668) | Skewed |
df_index has unique values | Unique |
fr_Al_COO has 8011 (94.0%) zeros | Zeros |
fr_Al_OH has 7737 (90.8%) zeros | Zeros |
fr_Al_OH_noTert has 7823 (91.8%) zeros | Zeros |
fr_Ar_OH has 8098 (95.0%) zeros | Zeros |
fr_COO has 7839 (92.0%) zeros | Zeros |
fr_COO2 has 7837 (92.0%) zeros | Zeros |
fr_C_O has 3445 (40.4%) zeros | Zeros |
fr_C_O_noCOO has 3859 (45.3%) zeros | Zeros |
fr_N_O has 8492 (99.6%) zeros | Zeros |
fr_alkyl_halide has 8095 (95.0%) zeros | Zeros |
fr_allylic_oxid has 8018 (94.1%) zeros | Zeros |
fr_amide has 4824 (56.6%) zeros | Zeros |
fr_aniline has 4762 (55.9%) zeros | Zeros |
fr_aryl_methyl has 5982 (70.2%) zeros | Zeros |
fr_benzene has 1533 (18.0%) zeros | Zeros |
fr_bicyclic has 4524 (53.1%) zeros | Zeros |
fr_ether has 4596 (53.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-04 07:15:33.239492 |
|---|---|
| Analysis finished | 2022-11-04 07:16:38.694353 |
| Duration | 1 minute and 5.45 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 8522 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6220.515372 |
| Minimum | 0 |
|---|---|
| Maximum | 12664 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 591.05 |
| Q1 | 3023.25 |
| median | 6158 |
| Q3 | 9418.5 |
| 95-th percentile | 11950.95 |
| Maximum | 12664 |
| Range | 12664 |
| Interquartile range (IQR) | 6395.25 |
Descriptive statistics
| Standard deviation | 3657.676603 |
|---|---|
| Coefficient of variation (CV) | 0.5880021805 |
| Kurtosis | -1.216095087 |
| Mean | 6220.515372 |
| Median Absolute Deviation (MAD) | 3197 |
| Skewness | 0.0250161244 |
| Sum | 53011232 |
| Variance | 13378598.13 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5124 | 1 | < 0.1% |
| 9906 | 1 | < 0.1% |
| 6421 | 1 | < 0.1% |
| 6402 | 1 | < 0.1% |
| 9661 | 1 | < 0.1% |
| 2744 | 1 | < 0.1% |
| 5949 | 1 | < 0.1% |
| 4309 | 1 | < 0.1% |
| 7760 | 1 | < 0.1% |
| 2317 | 1 | < 0.1% |
| Other values (8512) | 8512 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 8 | 1 | |
| 10 | 1 | |
| 12 | 1 | |
| 14 | 1 | |
| 15 | 1 | |
| 17 | 1 | |
| 18 | 1 |
| Value | Count | Frequency (%) |
| 12664 | 1 | |
| 12663 | 1 | |
| 12661 | 1 | |
| 12660 | 1 | |
| 12659 | 1 | |
| 12658 | 1 | |
| 12657 | 1 | |
| 12656 | 1 | |
| 12654 | 1 | |
| 12653 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.07111006806 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 8011 |
| Zeros (%) | 94.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3100074197 |
|---|---|
| Coefficient of variation (CV) | 4.359543285 |
| Kurtosis | 46.43452819 |
| Mean | 0.07111006806 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.776962917 |
| Sum | 606 |
| Variance | 0.09610460025 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8011 | |
| 1 | 438 | 5.1% |
| 2 | 61 | 0.7% |
| 4 | 8 | 0.1% |
| 3 | 3 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8011 | |
| 1 | 438 | 5.1% |
| 2 | 61 | 0.7% |
| 3 | 3 | < 0.1% |
| 4 | 8 | 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 8 | 0.1% |
| 3 | 3 | < 0.1% |
| 2 | 61 | 0.7% |
| 1 | 438 | 5.1% |
| 0 | 8011 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1378784323 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 7737 |
| Zeros (%) | 90.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4918050219 |
|---|---|
| Coefficient of variation (CV) | 3.566946721 |
| Kurtosis | 23.03846342 |
| Mean | 0.1378784323 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.411191209 |
| Sum | 1175 |
| Variance | 0.2418721796 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7737 | |
| 1 | 497 | 5.8% |
| 2 | 213 | 2.5% |
| 3 | 53 | 0.6% |
| 4 | 17 | 0.2% |
| 5 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 7737 | |
| 1 | 497 | 5.8% |
| 2 | 213 | 2.5% |
| 3 | 53 | 0.6% |
| 4 | 17 | 0.2% |
| 5 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 5 | 5 | 0.1% |
| 4 | 17 | 0.2% |
| 3 | 53 | 0.6% |
| 2 | 213 | 2.5% |
| 1 | 497 | 5.8% |
| 0 | 7737 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1219197372 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 7823 |
| Zeros (%) | 91.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4601634792 |
|---|---|
| Coefficient of variation (CV) | 3.774314889 |
| Kurtosis | 25.38504456 |
| Mean | 0.1219197372 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.630351152 |
| Sum | 1039 |
| Variance | 0.2117504275 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7823 | |
| 1 | 441 | 5.2% |
| 2 | 196 | 2.3% |
| 3 | 46 | 0.5% |
| 4 | 12 | 0.1% |
| 5 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 7823 | |
| 1 | 441 | 5.2% |
| 2 | 196 | 2.3% |
| 3 | 46 | 0.5% |
| 4 | 12 | 0.1% |
| 5 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 4 | < 0.1% |
| 4 | 12 | 0.1% |
| 3 | 46 | 0.5% |
| 2 | 196 | 2.3% |
| 1 | 441 | 5.2% |
| 0 | 7823 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 309 |
| 2 | 32 |
| 3 | 5 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8175 | |
| 1 | 309 | 3.6% |
| 2 | 32 | 0.4% |
| 3 | 5 | 0.1% |
| 4 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8175 | |
| 1 | 309 | 3.6% |
| 2 | 32 | 0.4% |
| 3 | 5 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8175 | |
| 1 | 309 | 3.6% |
| 2 | 32 | 0.4% |
| 3 | 5 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8175 | |
| 1 | 309 | 3.6% |
| 2 | 32 | 0.4% |
| 3 | 5 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8175 | |
| 1 | 309 | 3.6% |
| 2 | 32 | 0.4% |
| 3 | 5 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8175 | |
| 1 | 309 | 3.6% |
| 2 | 32 | 0.4% |
| 3 | 5 | 0.1% |
| 4 | 1 | < 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 161 |
| 2 | 23 |
| 3 | 2 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8335 | |
| 1 | 161 | 1.9% |
| 2 | 23 | 0.3% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8335 | |
| 1 | 161 | 1.9% |
| 2 | 23 | 0.3% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8335 | |
| 1 | 161 | 1.9% |
| 2 | 23 | 0.3% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8335 | |
| 1 | 161 | 1.9% |
| 2 | 23 | 0.3% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8335 | |
| 1 | 161 | 1.9% |
| 2 | 23 | 0.3% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8335 | |
| 1 | 161 | 1.9% |
| 2 | 23 | 0.3% |
| 3 | 2 | < 0.1% |
| 4 | 1 | < 0.1% |
fr_HOCCN
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 41 |
| 2 | 1 |
| 3 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8479 | |
| 1 | 41 | 0.5% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8479 | |
| 1 | 41 | 0.5% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8479 | |
| 1 | 41 | 0.5% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8479 | |
| 1 | 41 | 0.5% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8479 | |
| 1 | 41 | 0.5% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8479 | |
| 1 | 41 | 0.5% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06841117109 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 8098 |
| Zeros (%) | 95.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3361971914 |
|---|---|
| Coefficient of variation (CV) | 4.914361003 |
| Kurtosis | 50.67120759 |
| Mean | 0.06841117109 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.334491027 |
| Sum | 583 |
| Variance | 0.1130285515 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8098 | |
| 1 | 301 | 3.5% |
| 2 | 100 | 1.2% |
| 3 | 12 | 0.1% |
| 4 | 9 | 0.1% |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8098 | |
| 1 | 301 | 3.5% |
| 2 | 100 | 1.2% |
| 3 | 12 | 0.1% |
| 4 | 9 | 0.1% |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 2 | < 0.1% |
| 4 | 9 | 0.1% |
| 3 | 12 | 0.1% |
| 2 | 100 | 1.2% |
| 1 | 301 | 3.5% |
| 0 | 8098 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 720 |
| 2 | 54 |
| 4 | 3 |
| 3 | 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09657357428 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 7839 |
| Zeros (%) | 92.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.359543052 |
|---|---|
| Coefficient of variation (CV) | 3.72299622 |
| Kurtosis | 30.36310473 |
| Mean | 0.09657357428 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.753597433 |
| Sum | 823 |
| Variance | 0.1292712062 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7839 | |
| 1 | 570 | 6.7% |
| 2 | 97 | 1.1% |
| 4 | 9 | 0.1% |
| 3 | 6 | 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 7839 | |
| 1 | 570 | 6.7% |
| 2 | 97 | 1.1% |
| 3 | 6 | 0.1% |
| 4 | 9 | 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 9 | 0.1% |
| 3 | 6 | 0.1% |
| 2 | 97 | 1.1% |
| 1 | 570 | 6.7% |
| 0 | 7839 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09680826097 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 7837 |
| Zeros (%) | 92.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3598062409 |
|---|---|
| Coefficient of variation (CV) | 3.716689437 |
| Kurtosis | 30.26253802 |
| Mean | 0.09680826097 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.744937291 |
| Sum | 825 |
| Variance | 0.129460531 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 7837 | |
| 1 | 572 | 6.7% |
| 2 | 97 | 1.1% |
| 4 | 9 | 0.1% |
| 3 | 6 | 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 7837 | |
| 1 | 572 | 6.7% |
| 2 | 97 | 1.1% |
| 3 | 6 | 0.1% |
| 4 | 9 | 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 9 | 0.1% |
| 3 | 6 | 0.1% |
| 2 | 97 | 1.1% |
| 1 | 572 | 6.7% |
| 0 | 7837 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8900492842 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 3445 |
| Zeros (%) | 40.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9215168518 |
|---|---|
| Coefficient of variation (CV) | 1.03535486 |
| Kurtosis | 0.8211137976 |
| Mean | 0.8900492842 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.9661923124 |
| Sum | 7585 |
| Variance | 0.8491933082 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3445 | |
| 1 | 3146 | |
| 2 | 1452 | |
| 3 | 396 | 4.6% |
| 4 | 71 | 0.8% |
| 5 | 9 | 0.1% |
| 6 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3445 | |
| 1 | 3146 | |
| 2 | 1452 | |
| 3 | 396 | 4.6% |
| 4 | 71 | 0.8% |
| 5 | 9 | 0.1% |
| 6 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 3 | < 0.1% |
| 5 | 9 | 0.1% |
| 4 | 71 | 0.8% |
| 3 | 396 | 4.6% |
| 2 | 1452 | |
| 1 | 3146 | |
| 0 | 3445 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7992255339 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 3859 |
| Zeros (%) | 45.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8918435503 |
|---|---|
| Coefficient of variation (CV) | 1.115884706 |
| Kurtosis | 0.9805873342 |
| Mean | 0.7992255339 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.06009802 |
| Sum | 6811 |
| Variance | 0.7953849183 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3859 | |
| 1 | 2983 | |
| 2 | 1288 | 15.1% |
| 3 | 327 | 3.8% |
| 4 | 56 | 0.7% |
| 5 | 7 | 0.1% |
| 6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3859 | |
| 1 | 2983 | |
| 2 | 1288 | 15.1% |
| 3 | 327 | 3.8% |
| 4 | 56 | 0.7% |
| 5 | 7 | 0.1% |
| 6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 2 | < 0.1% |
| 5 | 7 | 0.1% |
| 4 | 56 | 0.7% |
| 3 | 327 | 3.8% |
| 2 | 1288 | 15.1% |
| 1 | 2983 | |
| 0 | 3859 |
fr_C_S
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 229 |
| 4 | 1 |
| 2 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8291 | |
| 1 | 229 | 2.7% |
| 4 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8291 | |
| 1 | 229 | 2.7% |
| 4 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8291 | |
| 1 | 229 | 2.7% |
| 4 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8291 | |
| 1 | 229 | 2.7% |
| 4 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8291 | |
| 1 | 229 | 2.7% |
| 4 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8291 | |
| 1 | 229 | 2.7% |
| 4 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
fr_Imine
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 260 |
| 2 | 27 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8235 | |
| 1 | 260 | 3.1% |
| 2 | 27 | 0.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8235 | |
| 1 | 260 | 3.1% |
| 2 | 27 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8235 | |
| 1 | 260 | 3.1% |
| 2 | 27 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8235 | |
| 1 | 260 | 3.1% |
| 2 | 27 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8235 | |
| 1 | 260 | 3.1% |
| 2 | 27 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8235 | |
| 1 | 260 | 3.1% |
| 2 | 27 | 0.3% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | |
| 3 | 134 |
| 4 | 9 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 2 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3892 | |
| 1 | 3469 | |
| 2 | 1018 | 11.9% |
| 3 | 134 | 1.6% |
| 4 | 9 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 3892 | |
| 1 | 3469 | |
| 2 | 1018 | 11.9% |
| 3 | 134 | 1.6% |
| 4 | 9 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3892 | |
| 1 | 3469 | |
| 2 | 1018 | 11.9% |
| 3 | 134 | 1.6% |
| 4 | 9 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3892 | |
| 1 | 3469 | |
| 2 | 1018 | 11.9% |
| 3 | 134 | 1.6% |
| 4 | 9 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3892 | |
| 1 | 3469 | |
| 2 | 1018 | 11.9% |
| 3 | 134 | 1.6% |
| 4 | 9 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3892 | |
| 1 | 3469 | |
| 2 | 1018 | 11.9% |
| 3 | 134 | 1.6% |
| 4 | 9 | 0.1% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 711 |
| 2 | 107 |
| 3 | 22 |
| 4 | 6 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 7676 | |
| 1 | 711 | 8.3% |
| 2 | 107 | 1.3% |
| 3 | 22 | 0.3% |
| 4 | 6 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 7676 | |
| 1 | 711 | 8.3% |
| 2 | 107 | 1.3% |
| 3 | 22 | 0.3% |
| 4 | 6 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7676 | |
| 1 | 711 | 8.3% |
| 2 | 107 | 1.3% |
| 3 | 22 | 0.3% |
| 4 | 6 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7676 | |
| 1 | 711 | 8.3% |
| 2 | 107 | 1.3% |
| 3 | 22 | 0.3% |
| 4 | 6 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7676 | |
| 1 | 711 | 8.3% |
| 2 | 107 | 1.3% |
| 3 | 22 | 0.3% |
| 4 | 6 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7676 | |
| 1 | 711 | 8.3% |
| 2 | 107 | 1.3% |
| 3 | 22 | 0.3% |
| 4 | 6 | 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.007627317531 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 8492 |
| Zeros (%) | 99.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1422835147 |
|---|---|
| Coefficient of variation (CV) | 18.65446327 |
| Kurtosis | 646.8032748 |
| Mean | 0.007627317531 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 23.02680668 |
| Sum | 65 |
| Variance | 0.02024459856 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8492 | |
| 2 | 15 | 0.2% |
| 1 | 7 | 0.1% |
| 3 | 6 | 0.1% |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8492 | |
| 1 | 7 | 0.1% |
| 2 | 15 | 0.2% |
| 3 | 6 | 0.1% |
| 4 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 6 | 0.1% |
| 2 | 15 | 0.2% |
| 1 | 7 | 0.1% |
| 0 | 8492 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 391 |
| 2 | 12 |
| 3 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8118 | |
| 1 | 391 | 4.6% |
| 2 | 12 | 0.1% |
| 3 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8118 | |
| 1 | 391 | 4.6% |
| 2 | 12 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8118 | |
| 1 | 391 | 4.6% |
| 2 | 12 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8118 | |
| 1 | 391 | 4.6% |
| 2 | 12 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8118 | |
| 1 | 391 | 4.6% |
| 2 | 12 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8118 | |
| 1 | 391 | 4.6% |
| 2 | 12 | 0.1% |
| 3 | 1 | < 0.1% |
fr_Ndealkylation2
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 327 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 7440 | |
| 1 | 755 | 8.9% |
| 2 | 327 | 3.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 7440 | |
| 1 | 755 | 8.9% |
| 2 | 327 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7440 | |
| 1 | 755 | 8.9% |
| 2 | 327 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7440 | |
| 1 | 755 | 8.9% |
| 2 | 327 | 3.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7440 | |
| 1 | 755 | 8.9% |
| 2 | 327 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7440 | |
| 1 | 755 | 8.9% |
| 2 | 327 | 3.8% |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 720 |
| 2 | 54 |
| 4 | 3 |
| 3 | 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7742 | |
| 1 | 720 | 8.4% |
| 2 | 54 | 0.6% |
| 4 | 3 | < 0.1% |
| 3 | 3 | < 0.1% |
fr_SH
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 10 |
| 2 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8510 | |
| 1 | 10 | 0.1% |
| 2 | 2 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8510 | |
| 1 | 10 | 0.1% |
| 2 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8510 | |
| 1 | 10 | 0.1% |
| 2 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8510 | |
| 1 | 10 | 0.1% |
| 2 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8510 | |
| 1 | 10 | 0.1% |
| 2 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8510 | |
| 1 | 10 | 0.1% |
| 2 | 2 | < 0.1% |
fr_aldehyde
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 22 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8500 | |
| 1 | 22 | 0.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8500 | |
| 1 | 22 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8500 | |
| 1 | 22 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8500 | |
| 1 | 22 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8500 | |
| 1 | 22 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8500 | |
| 1 | 22 | 0.3% |
fr_alkyl_carbamate
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 48 |
| 2 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8473 | |
| 1 | 48 | 0.6% |
| 2 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8473 | |
| 1 | 48 | 0.6% |
| 2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8473 | |
| 1 | 48 | 0.6% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8473 | |
| 1 | 48 | 0.6% |
| 2 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8473 | |
| 1 | 48 | 0.6% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8473 | |
| 1 | 48 | 0.6% |
| 2 | 1 | < 0.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1674489556 |
| Minimum | 0 |
|---|---|
| Maximum | 12 |
| Zeros | 8095 |
| Zeros (%) | 95.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.95 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8039955921 |
|---|---|
| Coefficient of variation (CV) | 4.801436886 |
| Kurtosis | 38.42927419 |
| Mean | 0.1674489556 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.68233856 |
| Sum | 1427 |
| Variance | 0.6464089121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8095 | |
| 3 | 286 | 3.4% |
| 6 | 71 | 0.8% |
| 1 | 39 | 0.5% |
| 2 | 22 | 0.3% |
| 4 | 4 | < 0.1% |
| 12 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8095 | |
| 1 | 39 | 0.5% |
| 2 | 22 | 0.3% |
| 3 | 286 | 3.4% |
| 4 | 4 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 71 | 0.8% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 12 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 71 | 0.8% |
| 5 | 1 | < 0.1% |
| 4 | 4 | < 0.1% |
| 3 | 286 | 3.4% |
| 2 | 22 | 0.3% |
| 1 | 39 | 0.5% |
| 0 | 8095 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09633888759 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 8018 |
| Zeros (%) | 94.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.449269612 |
|---|---|
| Coefficient of variation (CV) | 4.663429517 |
| Kurtosis | 42.41708284 |
| Mean | 0.09633888759 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.001525375 |
| Sum | 821 |
| Variance | 0.2018431843 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 8018 | |
| 1 | 309 | 3.6% |
| 2 | 115 | 1.3% |
| 3 | 45 | 0.5% |
| 4 | 29 | 0.3% |
| 5 | 5 | 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 8018 | |
| 1 | 309 | 3.6% |
| 2 | 115 | 1.3% |
| 3 | 45 | 0.5% |
| 4 | 29 | 0.3% |
| 5 | 5 | 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 5 | 5 | 0.1% |
| 4 | 29 | 0.3% |
| 3 | 45 | 0.5% |
| 2 | 115 | 1.3% |
| 1 | 309 | 3.6% |
| 0 | 8018 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.6084252523 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 4824 |
| Zeros (%) | 56.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.828598886 |
|---|---|
| Coefficient of variation (CV) | 1.361874582 |
| Kurtosis | 2.09559462 |
| Mean | 0.6084252523 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.444192169 |
| Sum | 5185 |
| Variance | 0.6865761139 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4824 | |
| 1 | 2578 | |
| 2 | 798 | 9.4% |
| 3 | 286 | 3.4% |
| 4 | 29 | 0.3% |
| 5 | 5 | 0.1% |
| 6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4824 | |
| 1 | 2578 | |
| 2 | 798 | 9.4% |
| 3 | 286 | 3.4% |
| 4 | 29 | 0.3% |
| 5 | 5 | 0.1% |
| 6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 2 | < 0.1% |
| 5 | 5 | 0.1% |
| 4 | 29 | 0.3% |
| 3 | 286 | 3.4% |
| 2 | 798 | 9.4% |
| 1 | 2578 | |
| 0 | 4824 |
fr_amidine
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 141 |
| 2 | 10 |
| 3 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8370 | |
| 1 | 141 | 1.7% |
| 2 | 10 | 0.1% |
| 3 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8370 | |
| 1 | 141 | 1.7% |
| 2 | 10 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8370 | |
| 1 | 141 | 1.7% |
| 2 | 10 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8370 | |
| 1 | 141 | 1.7% |
| 2 | 10 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8370 | |
| 1 | 141 | 1.7% |
| 2 | 10 | 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8370 | |
| 1 | 141 | 1.7% |
| 2 | 10 | 0.1% |
| 3 | 1 | < 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5452945318 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 4762 |
| Zeros (%) | 55.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7008148484 |
|---|---|
| Coefficient of variation (CV) | 1.285204248 |
| Kurtosis | 1.951784924 |
| Mean | 0.5452945318 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.264917454 |
| Sum | 4647 |
| Variance | 0.4911414518 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4762 | |
| 1 | 2998 | |
| 2 | 656 | 7.7% |
| 3 | 91 | 1.1% |
| 4 | 12 | 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4762 | |
| 1 | 2998 | |
| 2 | 656 | 7.7% |
| 3 | 91 | 1.1% |
| 4 | 12 | 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 12 | 0.1% |
| 3 | 91 | 1.1% |
| 2 | 656 | 7.7% |
| 1 | 2998 | |
| 0 | 4762 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4166862239 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 5982 |
| Zeros (%) | 70.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7330886649 |
|---|---|
| Coefficient of variation (CV) | 1.759330217 |
| Kurtosis | 4.868258294 |
| Mean | 0.4166862239 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.99327738 |
| Sum | 3551 |
| Variance | 0.5374189906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 1713 | 20.1% |
| 2 | 683 | 8.0% |
| 3 | 118 | 1.4% |
| 4 | 17 | 0.2% |
| 6 | 5 | 0.1% |
| 5 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 5982 | |
| 1 | 1713 | 20.1% |
| 2 | 683 | 8.0% |
| 3 | 118 | 1.4% |
| 4 | 17 | 0.2% |
| 5 | 4 | < 0.1% |
| 6 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 6 | 5 | 0.1% |
| 5 | 4 | < 0.1% |
| 4 | 17 | 0.2% |
| 3 | 118 | 1.4% |
| 2 | 683 | 8.0% |
| 1 | 1713 | 20.1% |
| 0 | 5982 |
fr_azide
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 5 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8517 | |
| 1 | 5 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8517 | |
| 1 | 5 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8517 | |
| 1 | 5 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8517 | |
| 1 | 5 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8517 | |
| 1 | 5 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8517 | |
| 1 | 5 | 0.1% |
fr_azo
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 45 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8477 | |
| 1 | 45 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8477 | |
| 1 | 45 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8477 | |
| 1 | 45 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8477 | |
| 1 | 45 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8477 | |
| 1 | 45 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8477 | |
| 1 | 45 | 0.5% |
fr_barbitur
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 7 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8515 | |
| 1 | 7 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8515 | |
| 1 | 7 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8515 | |
| 1 | 7 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8515 | |
| 1 | 7 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8515 | |
| 1 | 7 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8515 | |
| 1 | 7 | 0.1% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.329852147 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1533 |
| Zeros (%) | 18.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8782060224 |
|---|---|
| Coefficient of variation (CV) | 0.6603786926 |
| Kurtosis | -0.1171408996 |
| Mean | 1.329852147 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.2502593998 |
| Sum | 11333 |
| Variance | 0.7712458178 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3368 | |
| 2 | 2979 | |
| 0 | 1533 | |
| 3 | 567 | 6.7% |
| 4 | 70 | 0.8% |
| 5 | 4 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1533 | |
| 1 | 3368 | |
| 2 | 2979 | |
| 3 | 567 | 6.7% |
| 4 | 70 | 0.8% |
| 5 | 4 | < 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 5 | 4 | < 0.1% |
| 4 | 70 | 0.8% |
| 3 | 567 | 6.7% |
| 2 | 2979 | |
| 1 | 3368 | |
| 0 | 1533 |
fr_benzodiazepine
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8520 | |
| 1 | 2 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8520 | |
| 1 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8520 | |
| 1 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8520 | |
| 1 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8520 | |
| 1 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8520 | |
| 1 | 2 | < 0.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7514667918 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 4524 |
| Zeros (%) | 53.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.111631364 |
|---|---|
| Coefficient of variation (CV) | 1.479282086 |
| Kurtosis | 6.423748916 |
| Mean | 0.7514667918 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.249447587 |
| Sum | 6404 |
| Variance | 1.235724289 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4524 | |
| 1 | 2791 | |
| 2 | 528 | 6.2% |
| 3 | 401 | 4.7% |
| 5 | 122 | 1.4% |
| 4 | 108 | 1.3% |
| 6 | 29 | 0.3% |
| 7 | 15 | 0.2% |
| 8 | 3 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4524 | |
| 1 | 2791 | |
| 2 | 528 | 6.2% |
| 3 | 401 | 4.7% |
| 4 | 108 | 1.3% |
| 5 | 122 | 1.4% |
| 6 | 29 | 0.3% |
| 7 | 15 | 0.2% |
| 8 | 3 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 8 | 3 | < 0.1% |
| 7 | 15 | 0.2% |
| 6 | 29 | 0.3% |
| 5 | 122 | 1.4% |
| 4 | 108 | 1.3% |
| 3 | 401 | 4.7% |
| 2 | 528 | 6.2% |
| 1 | 2791 | |
| 0 | 4524 |
fr_diazo
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8521 | |
| 1 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8521 | |
| 1 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8521 | |
| 1 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8521 | |
| 1 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8521 | |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8521 | |
| 1 | 1 | < 0.1% |
fr_dihydropyridine
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 21 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8501 | |
| 1 | 21 | 0.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8501 | |
| 1 | 21 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8501 | |
| 1 | 21 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8501 | |
| 1 | 21 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8501 | |
| 1 | 21 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8501 | |
| 1 | 21 | 0.2% |
fr_epoxide
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | 36 |
| 2 | 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8483 | |
| 1 | 36 | 0.4% |
| 2 | 3 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 8483 | |
| 1 | 36 | 0.4% |
| 2 | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8483 | |
| 1 | 36 | 0.4% |
| 2 | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8483 | |
| 1 | 36 | 0.4% |
| 2 | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8483 | |
| 1 | 36 | 0.4% |
| 2 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8483 | |
| 1 | 36 | 0.4% |
| 2 | 3 | < 0.1% |
fr_ester
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 0 | |
|---|---|
| 1 | |
| 2 | 146 |
| 3 | 12 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8522 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 7574 | |
| 1 | 789 | 9.3% |
| 2 | 146 | 1.7% |
| 3 | 12 | 0.1% |
| 4 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 7574 | |
| 1 | 789 | 9.3% |
| 2 | 146 | 1.7% |
| 3 | 12 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7574 | |
| 1 | 789 | 9.3% |
| 2 | 146 | 1.7% |
| 3 | 12 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7574 | |
| 1 | 789 | 9.3% |
| 2 | 146 | 1.7% |
| 3 | 12 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7574 | |
| 1 | 789 | 9.3% |
| 2 | 146 | 1.7% |
| 3 | 12 | 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7574 | |
| 1 | 789 | 9.3% |
| 2 | 146 | 1.7% |
| 3 | 12 | 0.1% |
| 4 | 1 | < 0.1% |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7033560197 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 4596 |
| Zeros (%) | 53.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 66.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.9441686958 |
|---|---|
| Coefficient of variation (CV) | 1.342376648 |
| Kurtosis | 3.176717289 |
| Mean | 0.7033560197 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.593580931 |
| Sum | 5994 |
| Variance | 0.8914545262 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4596 | |
| 1 | 2454 | |
| 2 | 1061 | 12.5% |
| 3 | 274 | 3.2% |
| 4 | 98 | 1.1% |
| 5 | 32 | 0.4% |
| 6 | 6 | 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4596 | |
| 1 | 2454 | |
| 2 | 1061 | 12.5% |
| 3 | 274 | 3.2% |
| 4 | 98 | 1.1% |
| 5 | 32 | 0.4% |
| 6 | 6 | 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 6 | 6 | 0.1% |
| 5 | 32 | 0.4% |
| 4 | 98 | 1.1% |
| 3 | 274 | 3.2% |
| 2 | 1061 | 12.5% |
| 1 | 2454 | |
| 0 | 4596 |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | fr_Al_COO | fr_Al_OH | fr_Al_OH_noTert | fr_ArN | fr_Ar_COO | fr_HOCCN | fr_Ar_OH | fr_Ar_NH | fr_COO | fr_COO2 | fr_C_O | fr_C_O_noCOO | fr_C_S | fr_Imine | fr_NH1 | fr_NH2 | fr_N_O | fr_Ndealkylation1 | fr_Ndealkylation2 | fr_Nhpyrrole | fr_SH | fr_aldehyde | fr_alkyl_carbamate | fr_alkyl_halide | fr_allylic_oxid | fr_amide | fr_amidine | fr_aniline | fr_aryl_methyl | fr_azide | fr_azo | fr_barbitur | fr_benzene | fr_benzodiazepine | fr_bicyclic | fr_diazo | fr_dihydropyridine | fr_epoxide | fr_ester | fr_ether | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 5124 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 2 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 | 8178 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
| 2 | 7753 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 3 | 3392 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
| 4 | 1761 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| 5 | 6014 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 2 | 0 | 0 | 0 | 0 | 0 |
| 6 | 6047 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 2 |
| 7 | 6288 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 | 1012 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
| 9 | 2286 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
Last rows
| df_index | fr_Al_COO | fr_Al_OH | fr_Al_OH_noTert | fr_ArN | fr_Ar_COO | fr_HOCCN | fr_Ar_OH | fr_Ar_NH | fr_COO | fr_COO2 | fr_C_O | fr_C_O_noCOO | fr_C_S | fr_Imine | fr_NH1 | fr_NH2 | fr_N_O | fr_Ndealkylation1 | fr_Ndealkylation2 | fr_Nhpyrrole | fr_SH | fr_aldehyde | fr_alkyl_carbamate | fr_alkyl_halide | fr_allylic_oxid | fr_amide | fr_amidine | fr_aniline | fr_aryl_methyl | fr_azide | fr_azo | fr_barbitur | fr_benzene | fr_benzodiazepine | fr_bicyclic | fr_diazo | fr_dihydropyridine | fr_epoxide | fr_ester | fr_ether | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8512 | 4615 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 2 | 2 | 2 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8513 | 9434 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| 8514 | 8731 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 2 |
| 8515 | 2006 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 2 |
| 8516 | 11103 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
| 8517 | 8769 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 8518 | 5247 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8519 | 3835 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8520 | 10980 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
| 8521 | 3473 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |